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SHOETEE AETICLES AND DISCUSSION 

A SIMPLE TEST OP THE GOODNESS OF FIT OF 
MENDELIAN RATIOS 

In actual experimentation the so-called Mendelian ratios, 
3 : 1, 9 : 3 : 3 : 1, 9 : 3 : 4, 9 : 7, 15 : 1, 27 : 9 : 9 : 9 : 3 : 3 : 3 : 1, etc., are 
never exactly realized because of the errors of sampling inherent 
in all statistical work. Notwithstanding this fact, the best the- 
oretical formulae must be selected on the basis of these mislead- 
ing experimental results. 

Now the test of the validity of any Mendelian formula is two- 
fold: the number of individuals found should agree with the 
number expected within the limits of experimental error, 1 the 
assumed germinal composition of the several groups of individ- 
uals should be capable of substantiation from a study of the 
soma of their offspring. 

For the most part, Mendelians have been satisfied to judge 
the goodness of fit of the theoretical frequency to the empirical 
by inspection merely. More recently, however, attempts have 
been made to apply scientific tests to this problem. The first 
was that of Weldon, 2 but Professor Johannsen doubtless de- 
serves the credit of having interested the few Mendelian workers 
who have taken the pains to calculate probable errors in this 
indispensable part of their work. 

The test used by Professor Weldon and recommended in a 
much extended form by Professor Johannsen 3 is essentially the 
determination of the probable error of the number of individ- 
uals in one of the subgroups by the formula 



0. = V»XPX4 

where p is the chance of occurrence of an individual of any 
class, q = 1 — p, and n is the ntunber of individuals. Thus the 

1 In some eases, valid reasons for discrepancy between calculated and 
observed frequencies may be shown. These factors should then be taken 
into account in calculating the theoretical numbers. 

2 Weldon, W. F. E., "Mendel's Laws of Alternative Inheritance in Peas," 
Biometrilea, 1 : 228-254, 1902, especially pp. 233-234. 

s Johannsen, W v "Elemente der Exakten Erblichkeitslehre, " pp. 402- 
410, 1909. 
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"probable error" of the number of individuals of any class, 
say p, is 

Ep = .67449 \/npq. 

Now while Professor "Welclon's use of this formula for the 
simple 3 : 1 ratios seems quite proper, the same can not be said 
for Professor Johannsen's generalization. This is true for three 
reasons : 

(a) The formula is valid only when neither n, p nor q is 
small. In polyhybrid ratios p or q may be relatively small. 4 
It is then quite idle to use the probable error suggested, unless 
n be large, which unfortunately is generally not the case. 

(b) Even when p is not so low as to render the use of the 
conventional formula for the probable error open to question, 
it is very laborious to calculate the probable errors for the fre- 
quency of each class. 5 

(c) It is not only cumbersome and laborious, but theoretically 
unjustified to test the validity of a given ratio by the determina- 
tion of the probable error of one or of all of its individual com- 
ponent groups. The random deviations of the class frequencies 
are not independent, but correlated. "We must have a usable 
criterion of the goodness of fit of the theory to the data as a 
whole. 

Such a criterion was furnished several years ago by Pearson. 6 
Its applicability to the problem of testing the goodness of fit of 
Mendelian ratios seems obvious, but since, as far as I can ascer- 
tain, it has nowhere been applied to this problem, it seems 
worth while to call the attention of students of genetics to its 
usefulness. 

where o is observed frequency of any class, c is calculated fre- 
quency on the basis of Mendelian theory and S indicates a 
summation for the several classes distinguishable in the ratio 
under consideration. 

P, a measure on the scale of to 1 of the probability that 

4 For example, Johannsen (loo. cit., p. 405) tables values for p = 3/4, 
q= 1/4 to p = 63/64, q — 1/64. 

5 See, for instance, the example given by Johannsen, loc. cit., p. 396. 

e Pearson, K., "On the Criterion that a Given System of Deviations 
from the Probable in the Case of a Correlated System of Variables is Such 
that it Can be Reasonably Supposed to have Arisen from Eandom Sampling, ' ' 
PMl. Mag., 50: 157-175, 1900. 
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the deviations from the theoretical frequencies may be reason- 
ably supposed to be due to the errors of sampling, may be cal- 
culated from x 2 ^oy formulae which need not concern us here, 
since its values for systems of frequency of 3-30 classes have 
been tabled. 7 Hence the Mendelian has only the simple task of 
calculating x 2 and looking up the value of P in Elderton's tables. 
Illustrations will make method of computation and usefulness 
most clear. 



Illustration I. Doubleness and Plastid Color in Stocks 
Saunders, Joum. Gen., 1: 349-350, 1911 




Obs. 


Cale. 


o-c 


(0-c)2 


(0-C)2/ 




1,666 
773 
790 


1,615 
807 
807 


51 
-34 
-17 


2,601 

1,156 

289 


1.611 


Doubles, White 


1.433 




.358 








3,229 


3,229 







X 2 = 3.402 



Whence, from the tables in Biometrika, and by interpolation, 
n' = 3, x 2 = 3, P = . 223130 



n' = 3, 



-A, P = . 135335 



Diff. = .087795 



P = .223130 — .087795 X .402 =■ .1878. 

Thus only in about one case in five would the errors of sampling 
lead to divergences from theory as bad as this. The theory is, 
as far as this evidence goes, possible, but certainly not demon- 
strated. 



Illustration II. Seed Form and Color in Pisum 
Bateson and Killby, Report Evol. Com., 2: 77, 1905 



Round, Yellow. . . 
Wrinkled, Yellow. 
Round, Green . . . . 
Wrinkled, Green. . 



Obs. 



4,926 
1,656 
1,621 

478 



Cale* 



4,883 

1,628 

1,628 

542 



+43 
+28 
- 7 
-64 



(o-e)^c 



.3787 

.4816 

.0301 

7.5572 



x 2 = 8.4476, P=.0384. Thus taking the data as they stand, 
it is impossible to regard the 9:3:3:1 ratio as satisfactorily 

' Pearson, loo. tit., gives a small table. A much more comprehensive one 
is given by W. Palin Elderton, "Tables for Testing the Goodness of Pit of 
Theory to Observation," Biometrika, 1: 155-163, 1901. 

8 These are not the calculated frequencies given by Bateson and Killby,. 
but have been recalculated as closely as possible on the 9:3:3:1 ratio. 
Theirs are nine seeds short. 
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describing the facts. But the great factor in the magnitude 
of x 2 is the deficiency in the wrinkled green seeds, and the 
authoi's have suggested a reasonable biological explanation for 
this deficiency. 

Illustration III. Color in Oats 
Nillson-Ehle, fide Baur. Einf. Exp. Vererlungsl., pp. 66-67 



Schwarzspelzig 
Grauspelzig. . . 
Weissspelzig . . 



Obs. 



418 

106 

36 



Calc. 



420 

105 

35 



o-c ' 



_2 

+ 1 
+ 1 



(o-c'Wc 



.0095 
.0095 
.0286 



Thus x 2 = -0476 only. P is not tabled for x 2 < 1, since the 
probabilities of such deviations being due simply to errors of 
sampling are so enormously high. Theory and observation 
could hardly agree more perfectly. 

Illustration IV. Body Color in Dr< 
Morgan, Journ. Exp. Zool., 13: 35, 1912 





Obs. 


Calc. 


o-c 


(o-c)-jc 


Gray 9 


525 
340 
194 


529 
265 
265 


- 4 
+75 
-71 


.030 


Gray c? 


21.226 


Yellow d 1 


19.023 










X 2 =40.279 



Here x 2 is over 40, the odds against the deviations, being due 
to errors of sampling, are so enormously great that it is idle to 
express them in figures. In short, the facts do not substantiate 
the hypothesis, and Professor Morgan has himself suggested 
possible reasons for the disagreement. 



Illustration V. Partial Gametic Coupling in Sweet Peas 
Bateson, Saunders and Punnett, Rep. Evol. Com., 4: 11 



Observed 

Number of 

Cases 



Calculated on 

7:1:1:7 

Basis 



Calculated on 

7:1:1:7 

Basis 



Purple, long. 
Purple, round 
Red, long. . . . 
Red, round . . 



493 
25 
25 

138 



471 
40 
40 

130 



490 
20 
20 

151 



For the 7:1:1:7 basis, x 2 = 12.7699, P==.0053. For the 
15:1:1:15 hypothesis, x 2 = 3.6375, P = .3086. Thus the 
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chances are about 995 : 5 or 199 : 1 against the validity of the 
first hypothesis and only 69 : 31, or about 2 : 1, against the 
second. 

Illustration VI. Color Inheritance in Antirrhinum 
Wheldale, Marryat and Sollas, Bep. Evol. Com., 15: 15 





Obs. 


Calc. 


o-c 


(o-6') 2 /e 


I. Magenta 


399 

122 
121 
38 
88 
35 
33 
19 


361 

120 

120 

40 

120 

40 

40 

14 


+38 
+ 2 
+ 1 
_ 2 
-32 

— 5 

- 7 
+ 5 


4.000 
.033 


I. crimson 

I. crimson delila 

T. ivory 


.008 

.100 

9.075 


T, ivory delila 

T. yellow 


.625 
1.255 




1.786 








855 


855 


X 2 


= 16.852 



Hence, P = .0185 or the chances are about 980:20 against such 
discrepancies being chance deviations from the theory. Thus 
either the theory must be discarded or reasons for the discrep- 
ancies found. 

A conspicuous advantage of this method of Pearson is that in 
its application the deviation of observation from theory for 
each class and the amount which this discrepancy contributes 
to x 2 are under the worker's eye. 

If used with the caution that should be exercised in the draw- 
ing of any conclusion from probable errors, 9 I believe that this 
well-known criterion of goodness of fit will prove most useful 
to Mendelians. 

J. Arthur Harris 

9 Some biologists apparently seem to feel that the calculation of a statis- 
tical "probable error" covers all the biological sins which may be com- 
mitted in the collection or manipulation of their data. 



